Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 759609 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 19554 |
| Duplicate rows (%) | 2.6% |
| Total size in memory | 169.0 MiB |
| Average record size in memory | 233.3 B |
Variable types
| Numeric | 19 |
|---|---|
| Categorical | 6 |
| Boolean | 3 |
moon_clearance_complete has constant value "False" | Constant |
| Dataset has 19554 (2.6%) duplicate rows | Duplicates |
company_location has a high cardinality: 159 distinct values | High cardinality |
id_x is highly overall correlated with shuttle_id | High correlation |
engines is highly overall correlated with shuttle_type and 3 other fields | High correlation |
passenger_capacity is highly overall correlated with engine_type and 2 other fields | High correlation |
crew is highly overall correlated with engines and 1 other fields | High correlation |
price is highly overall correlated with engines and 2 other fields | High correlation |
company_id is highly overall correlated with shuttle_location and 6 other fields | High correlation |
shuttle_id is highly overall correlated with id_x | High correlation |
review_scores_rating is highly overall correlated with review_scores_comfort and 5 other fields | High correlation |
review_scores_comfort is highly overall correlated with review_scores_rating and 4 other fields | High correlation |
review_scores_amenities is highly overall correlated with review_scores_rating and 4 other fields | High correlation |
review_scores_trip is highly overall correlated with review_scores_rating and 4 other fields | High correlation |
review_scores_crew is highly overall correlated with review_scores_rating and 5 other fields | High correlation |
review_scores_price is highly overall correlated with review_scores_rating and 5 other fields | High correlation |
number_of_reviews is highly overall correlated with reviews_per_month | High correlation |
reviews_per_month is highly overall correlated with number_of_reviews | High correlation |
id_y is highly overall correlated with shuttle_location and 6 other fields | High correlation |
total_fleet_count is highly overall correlated with shuttle_location and 4 other fields | High correlation |
d_check_complete is highly overall correlated with company_id and 2 other fields | High correlation |
iata_approved is highly overall correlated with d_check_complete and 2 other fields | High correlation |
review_scores_location is highly overall correlated with review_scores_rating and 2 other fields | High correlation |
shuttle_location is highly overall correlated with engine_type and 3 other fields | High correlation |
shuttle_type is highly overall correlated with engines and 3 other fields | High correlation |
engine_type is highly overall correlated with shuttle_location and 3 other fields | High correlation |
engines has 40534 (5.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-24 14:46:49.926816 |
|---|---|
| Analysis finished | 2022-11-24 14:50:51.665598 |
| Duration | 4 minutes and 1.74 second |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
id_x
Real number (ℝ)
| Distinct | 29768 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38504.629 |
| Minimum | 4 |
|---|---|
| Maximum | 77095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4054 |
| Q1 | 19107 |
| median | 39049 |
| Q3 | 58363 |
| 95-th percentile | 72358 |
| Maximum | 77095 |
| Range | 77091 |
| Interquartile range (IQR) | 39256 |
Descriptive statistics
| Standard deviation | 22169.907 |
|---|---|
| Coefficient of variation (CV) | 0.57577251 |
| Kurtosis | -1.2108859 |
| Mean | 38504.629 |
| Median Absolute Deviation (MAD) | 19612 |
| Skewness | -0.017028223 |
| Sum | 2.9248463 × 1010 |
| Variance | 4.9150476 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8692 | 1086 | 0.1% |
| 29483 | 1086 | 0.1% |
| 52116 | 1086 | 0.1% |
| 25234 | 1086 | 0.1% |
| 35034 | 1086 | 0.1% |
| 41758 | 1086 | 0.1% |
| 17273 | 1086 | 0.1% |
| 13084 | 1086 | 0.1% |
| 19472 | 1086 | 0.1% |
| 18120 | 1086 | 0.1% |
| Other values (29758) | 748749 |
| Value | Count | Frequency (%) |
| 4 | 3 | < 0.1% |
| 7 | 8 | |
| 9 | 3 | < 0.1% |
| 11 | 3 | < 0.1% |
| 25 | 1 | < 0.1% |
| 26 | 19 | |
| 28 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 77095 | 1 | < 0.1% |
| 77094 | 60 | |
| 77088 | 14 | < 0.1% |
| 77087 | 4 | < 0.1% |
| 77085 | 14 | < 0.1% |
| 77083 | 11 | < 0.1% |
| 77078 | 1 | < 0.1% |
| 77076 | 7 | < 0.1% |
| 77072 | 2 | < 0.1% |
| 77069 | 53 |
shuttle_location
Categorical
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| Barbados | |
|---|---|
| Micronesia | |
| Malta | |
| Nicaragua | |
| Rwanda | |
| Other values (25) |
Length
| Max length | 25 |
|---|---|
| Median length | 18 |
| Mean length | 10.284496 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7812196 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Niue |
|---|---|
| 2nd row | Niue |
| 3rd row | Niue |
| 4th row | Niue |
| 5th row | Niue |
Common Values
| Value | Count | Frequency (%) |
| Barbados | 120776 | |
| Micronesia | 109398 | |
| Malta | 106708 | |
| Nicaragua | 92759 | |
| Rwanda | 69383 | |
| Russian Federation | 58490 | |
| Sao Tome and Principe | 50293 | |
| United Kingdom | 28488 | 3.8% |
| Niue | 26269 | 3.5% |
| Bouvet Island (Bouvetoya) | 23346 | 3.1% |
| Other values (20) | 73699 |
Length
| Value | Count | Frequency (%) |
| barbados | 120776 | 11.1% |
| micronesia | 109398 | 10.1% |
| malta | 106708 | 9.8% |
| nicaragua | 92759 | 8.5% |
| rwanda | 69383 | 6.4% |
| and | 65920 | 6.1% |
| russian | 58490 | 5.4% |
| federation | 58490 | 5.4% |
| sao | 50293 | 4.6% |
| tome | 50293 | 4.6% |
| Other values (34) | 303926 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1365388 | |
| i | 699351 | 9.0% |
| n | 556932 | 7.1% |
| o | 511438 | 6.5% |
| e | 457654 | 5.9% |
| r | 448070 | 5.7% |
| s | 420122 | 5.4% |
| d | 410804 | 5.3% |
| 326827 | 4.2% | |
| t | 278741 | 3.6% |
| Other values (34) | 2336869 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6419303 | |
| Uppercase Letter | 1019374 | 13.0% |
| Space Separator | 326827 | 4.2% |
| Open Punctuation | 23346 | 0.3% |
| Close Punctuation | 23346 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1365388 | |
| i | 699351 | |
| n | 556932 | |
| o | 511438 | 8.0% |
| e | 457654 | 7.1% |
| r | 448070 | 7.0% |
| s | 420122 | 6.5% |
| d | 410804 | 6.4% |
| t | 278741 | 4.3% |
| u | 265663 | 4.1% |
| Other values (14) | 1005140 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 217945 | |
| B | 168774 | |
| R | 131930 | |
| N | 119028 | |
| F | 77298 | 7.6% |
| S | 51370 | 5.0% |
| P | 50514 | 5.0% |
| T | 50293 | 4.9% |
| I | 39155 | 3.8% |
| K | 38419 | 3.8% |
| Other values (7) | 74648 | 7.3% |
Space Separator
| Value | Count | Frequency (%) |
| 326827 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23346 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23346 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7438677 | |
| Common | 373519 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1365388 | |
| i | 699351 | 9.4% |
| n | 556932 | 7.5% |
| o | 511438 | 6.9% |
| e | 457654 | 6.2% |
| r | 448070 | 6.0% |
| s | 420122 | 5.6% |
| d | 410804 | 5.5% |
| t | 278741 | 3.7% |
| u | 265663 | 3.6% |
| Other values (31) | 2024514 |
Common
| Value | Count | Frequency (%) |
| 326827 | ||
| ( | 23346 | 6.3% |
| ) | 23346 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7812196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1365388 | |
| i | 699351 | 9.0% |
| n | 556932 | 7.1% |
| o | 511438 | 6.5% |
| e | 457654 | 5.9% |
| r | 448070 | 5.7% |
| s | 420122 | 5.4% |
| d | 410804 | 5.3% |
| 326827 | 4.2% | |
| t | 278741 | 3.6% |
| Other values (34) | 2336869 |
shuttle_type
Categorical
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| Type V5 | |
|---|---|
| Type F5 | |
| Type G0 | 48629 |
| Type V2 | 8887 |
| Type Z6 | 2779 |
| Other values (27) | 7367 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 5317263 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Type V5 |
|---|---|
| 2nd row | Type V5 |
| 3rd row | Type V5 |
| 4th row | Type V5 |
| 5th row | Type V5 |
Common Values
| Value | Count | Frequency (%) |
| Type V5 | 504840 | |
| Type F5 | 187107 | 24.6% |
| Type G0 | 48629 | 6.4% |
| Type V2 | 8887 | 1.2% |
| Type Z6 | 2779 | 0.4% |
| Type O3 | 1876 | 0.2% |
| Type V7 | 1500 | 0.2% |
| Type N0 | 1323 | 0.2% |
| Type X3 | 547 | 0.1% |
| Type E3 | 474 | 0.1% |
| Other values (22) | 1647 | 0.2% |
Length
| Value | Count | Frequency (%) |
| type | 759609 | |
| v5 | 504840 | |
| f5 | 187107 | 12.3% |
| g0 | 48629 | 3.2% |
| v2 | 8887 | 0.6% |
| z6 | 2779 | 0.2% |
| o3 | 1876 | 0.1% |
| v7 | 1500 | 0.1% |
| n0 | 1323 | 0.1% |
| x3 | 547 | < 0.1% |
| Other values (23) | 2121 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 759636 | |
| y | 759609 | |
| p | 759609 | |
| e | 759609 | |
| 759609 | ||
| 5 | 692259 | |
| V | 515227 | |
| F | 187456 | 3.5% |
| 0 | 49971 | 0.9% |
| G | 48629 | 0.9% |
| Other values (23) | 25649 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2278827 | |
| Uppercase Letter | 1519218 | |
| Space Separator | 759609 | 14.3% |
| Decimal Number | 759609 | 14.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 759636 | |
| V | 515227 | |
| F | 187456 | 12.3% |
| G | 48629 | 3.2% |
| Z | 2832 | 0.2% |
| O | 1884 | 0.1% |
| N | 1323 | 0.1% |
| X | 547 | < 0.1% |
| E | 474 | < 0.1% |
| W | 287 | < 0.1% |
| Other values (11) | 923 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 692259 | |
| 0 | 49971 | 6.6% |
| 2 | 8893 | 1.2% |
| 3 | 2897 | 0.4% |
| 6 | 2799 | 0.4% |
| 7 | 2225 | 0.3% |
| 1 | 510 | 0.1% |
| 4 | 55 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 759609 | |
| p | 759609 | |
| e | 759609 |
Space Separator
| Value | Count | Frequency (%) |
| 759609 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3798045 | |
| Common | 1519218 | 28.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 759636 | |
| y | 759609 | |
| p | 759609 | |
| e | 759609 | |
| V | 515227 | |
| F | 187456 | 4.9% |
| G | 48629 | 1.3% |
| Z | 2832 | 0.1% |
| O | 1884 | < 0.1% |
| N | 1323 | < 0.1% |
| Other values (14) | 2231 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 759609 | ||
| 5 | 692259 | |
| 0 | 49971 | 3.3% |
| 2 | 8893 | 0.6% |
| 3 | 2897 | 0.2% |
| 6 | 2799 | 0.2% |
| 7 | 2225 | 0.1% |
| 1 | 510 | < 0.1% |
| 4 | 55 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5317263 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 759636 | |
| y | 759609 | |
| p | 759609 | |
| e | 759609 | |
| 759609 | ||
| 5 | 692259 | |
| V | 515227 | |
| F | 187456 | 3.5% |
| 0 | 49971 | 0.9% |
| G | 48629 | 0.9% |
| Other values (23) | 25649 | 0.5% |
engine_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| Plasma | |
|---|---|
| Quantum | |
| Nuclear | 6683 |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.1183635 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4647564 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Quantum |
|---|---|
| 2nd row | Quantum |
| 3rd row | Quantum |
| 4th row | Quantum |
| 5th row | Quantum |
Common Values
| Value | Count | Frequency (%) |
| Plasma | 669699 | |
| Quantum | 83227 | 11.0% |
| Nuclear | 6683 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| plasma | 669699 | |
| quantum | 83227 | 11.0% |
| nuclear | 6683 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1429308 | |
| m | 752926 | |
| l | 676382 | |
| P | 669699 | |
| s | 669699 | |
| u | 173137 | 3.7% |
| Q | 83227 | 1.8% |
| n | 83227 | 1.8% |
| t | 83227 | 1.8% |
| N | 6683 | 0.1% |
| Other values (3) | 20049 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3887955 | |
| Uppercase Letter | 759609 | 16.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1429308 | |
| m | 752926 | |
| l | 676382 | |
| s | 669699 | |
| u | 173137 | 4.5% |
| n | 83227 | 2.1% |
| t | 83227 | 2.1% |
| c | 6683 | 0.2% |
| e | 6683 | 0.2% |
| r | 6683 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 669699 | |
| Q | 83227 | 11.0% |
| N | 6683 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4647564 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1429308 | |
| m | 752926 | |
| l | 676382 | |
| P | 669699 | |
| s | 669699 | |
| u | 173137 | 3.7% |
| Q | 83227 | 1.8% |
| n | 83227 | 1.8% |
| t | 83227 | 1.8% |
| N | 6683 | 0.1% |
| Other values (3) | 20049 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4647564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1429308 | |
| m | 752926 | |
| l | 676382 | |
| P | 669699 | |
| s | 669699 | |
| u | 173137 | 3.7% |
| Q | 83227 | 1.8% |
| n | 83227 | 1.8% |
| t | 83227 | 1.8% |
| N | 6683 | 0.1% |
| Other values (3) | 20049 | 0.4% |
engine_vendor
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| ThetaBase Services | |
|---|---|
| Banks, Wood and Phillips | 797 |
| Warwick Technology Multinational | 168 |
| SIT Technology Unlimited | 81 |
| MCW Global | 35 |
Length
| Max length | 32 |
|---|---|
| Median length | 18 |
| Mean length | 18.009663 |
| Min length | 10 |
Characters and Unicode
| Total characters | 13680302 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ThetaBase Services |
|---|---|
| 2nd row | ThetaBase Services |
| 3rd row | ThetaBase Services |
| 4th row | ThetaBase Services |
| 5th row | Banks, Wood and Phillips |
Common Values
| Value | Count | Frequency (%) |
| ThetaBase Services | 758528 | |
| Banks, Wood and Phillips | 797 | 0.1% |
| Warwick Technology Multinational | 168 | < 0.1% |
| SIT Technology Unlimited | 81 | < 0.1% |
| MCW Global | 35 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| thetabase | 758528 | |
| services | 758528 | |
| banks | 797 | 0.1% |
| wood | 797 | 0.1% |
| and | 797 | 0.1% |
| phillips | 797 | 0.1% |
| technology | 249 | < 0.1% |
| warwick | 168 | < 0.1% |
| multinational | 168 | < 0.1% |
| sit | 81 | < 0.1% |
| Other values (3) | 151 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3034442 | |
| a | 1519189 | |
| s | 1518650 | |
| 761452 | 5.6% | |
| i | 760788 | 5.6% |
| h | 759574 | 5.6% |
| B | 759325 | 5.6% |
| t | 758945 | 5.5% |
| c | 758945 | 5.5% |
| T | 758858 | 5.5% |
| Other values (23) | 2290134 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10639029 | |
| Uppercase Letter | 2279024 | 16.7% |
| Space Separator | 761452 | 5.6% |
| Other Punctuation | 797 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3034442 | |
| a | 1519189 | |
| s | 1518650 | |
| i | 760788 | 7.2% |
| h | 759574 | 7.1% |
| t | 758945 | 7.1% |
| c | 758945 | 7.1% |
| r | 758696 | 7.1% |
| v | 758528 | 7.1% |
| l | 2330 | < 0.1% |
| Other values (11) | 8942 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 759325 | |
| T | 758858 | |
| S | 758609 | |
| W | 1000 | < 0.1% |
| P | 797 | < 0.1% |
| M | 203 | < 0.1% |
| I | 81 | < 0.1% |
| U | 81 | < 0.1% |
| C | 35 | < 0.1% |
| G | 35 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 761452 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12918053 | |
| Common | 762249 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3034442 | |
| a | 1519189 | |
| s | 1518650 | |
| i | 760788 | 5.9% |
| h | 759574 | 5.9% |
| B | 759325 | 5.9% |
| t | 758945 | 5.9% |
| c | 758945 | 5.9% |
| T | 758858 | 5.9% |
| r | 758696 | 5.9% |
| Other values (21) | 1530641 |
Common
| Value | Count | Frequency (%) |
| 761452 | ||
| , | 797 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13680302 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3034442 | |
| a | 1519189 | |
| s | 1518650 | |
| 761452 | 5.6% | |
| i | 760788 | 5.6% |
| h | 759574 | 5.6% |
| B | 759325 | 5.6% |
| t | 758945 | 5.5% |
| c | 758945 | 5.5% |
| T | 758858 | 5.5% |
| Other values (23) | 2290134 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0726413 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 40534 |
| Zeros (%) | 5.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.2778244 |
|---|---|
| Coefficient of variation (CV) | 0.61651981 |
| Kurtosis | 0.19454499 |
| Mean | 2.0726413 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.75108798 |
| Sum | 1574397 |
| Variance | 1.6328353 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 260579 | |
| 2 | 222290 | |
| 3 | 118778 | |
| 4 | 81211 | 10.7% |
| 0 | 40534 | 5.3% |
| 5 | 30588 | 4.0% |
| 6 | 4415 | 0.6% |
| 7 | 1141 | 0.2% |
| 8 | 51 | < 0.1% |
| 12 | 9 | < 0.1% |
| Other values (3) | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 40534 | 5.3% |
| 1 | 260579 | |
| 2 | 222290 | |
| 3 | 118778 | |
| 4 | 81211 | 10.7% |
| 5 | 30588 | 4.0% |
| 6 | 4415 | 0.6% |
| 7 | 1141 | 0.2% |
| 8 | 51 | < 0.1% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 12 | 9 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 4 | < 0.1% |
| 9 | 6 | < 0.1% |
| 8 | 51 | < 0.1% |
| 7 | 1141 | 0.2% |
| 6 | 4415 | 0.6% |
| 5 | 30588 | 4.0% |
| 4 | 81211 | |
| 3 | 118778 |
passenger_capacity
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7194873 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.3630074 |
|---|---|
| Coefficient of variation (CV) | 0.50069156 |
| Kurtosis | 0.51704786 |
| Mean | 4.7194873 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.77995214 |
| Sum | 3584965 |
| Variance | 5.5838042 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 235339 | |
| 2 | 145575 | |
| 6 | 114644 | |
| 8 | 66212 | 8.7% |
| 5 | 47306 | 6.2% |
| 3 | 42218 | 5.6% |
| 7 | 35721 | 4.7% |
| 1 | 23129 | 3.0% |
| 10 | 20230 | 2.7% |
| 9 | 19306 | 2.5% |
| Other values (7) | 9929 | 1.3% |
| Value | Count | Frequency (%) |
| 1 | 23129 | 3.0% |
| 2 | 145575 | |
| 3 | 42218 | 5.6% |
| 4 | 235339 | |
| 5 | 47306 | 6.2% |
| 6 | 114644 | |
| 7 | 35721 | 4.7% |
| 8 | 66212 | 8.7% |
| 9 | 19306 | 2.5% |
| 10 | 20230 | 2.7% |
| Value | Count | Frequency (%) |
| 20 | 6 | < 0.1% |
| 16 | 402 | 0.1% |
| 15 | 62 | < 0.1% |
| 14 | 1455 | 0.2% |
| 13 | 447 | 0.1% |
| 12 | 5695 | 0.7% |
| 11 | 1862 | 0.2% |
| 10 | 20230 | 2.7% |
| 9 | 19306 | 2.5% |
| 8 | 66212 |
cancellation_policy
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| strict | |
|---|---|
| moderate | 55883 |
| flexible | 30066 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.226298 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4729552 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | strict |
|---|---|
| 2nd row | strict |
| 3rd row | strict |
| 4th row | strict |
| 5th row | strict |
Common Values
| Value | Count | Frequency (%) |
| strict | 673660 | |
| moderate | 55883 | 7.4% |
| flexible | 30066 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| strict | 673660 | |
| moderate | 55883 | 7.4% |
| flexible | 30066 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1403203 | |
| r | 729543 | |
| i | 703726 | |
| s | 673660 | |
| c | 673660 | |
| e | 171898 | 3.6% |
| l | 60132 | 1.3% |
| m | 55883 | 1.2% |
| o | 55883 | 1.2% |
| d | 55883 | 1.2% |
| Other values (4) | 146081 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4729552 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1403203 | |
| r | 729543 | |
| i | 703726 | |
| s | 673660 | |
| c | 673660 | |
| e | 171898 | 3.6% |
| l | 60132 | 1.3% |
| m | 55883 | 1.2% |
| o | 55883 | 1.2% |
| d | 55883 | 1.2% |
| Other values (4) | 146081 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4729552 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1403203 | |
| r | 729543 | |
| i | 703726 | |
| s | 673660 | |
| c | 673660 | |
| e | 171898 | 3.6% |
| l | 60132 | 1.3% |
| m | 55883 | 1.2% |
| o | 55883 | 1.2% |
| d | 55883 | 1.2% |
| Other values (4) | 146081 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4729552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1403203 | |
| r | 729543 | |
| i | 703726 | |
| s | 673660 | |
| c | 673660 | |
| e | 171898 | 3.6% |
| l | 60132 | 1.3% |
| m | 55883 | 1.2% |
| o | 55883 | 1.2% |
| d | 55883 | 1.2% |
| Other values (4) | 146081 | 3.1% |
crew
Real number (ℝ)
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6232838 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 5757 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.6094034 |
|---|---|
| Coefficient of variation (CV) | 0.61350717 |
| Kurtosis | 2.2846916 |
| Mean | 2.6232838 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.198889 |
| Sum | 1992670 |
| Variance | 2.5901794 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 217973 | |
| 1 | 209179 | |
| 3 | 133015 | |
| 4 | 93646 | |
| 5 | 55912 | 7.4% |
| 6 | 26160 | 3.4% |
| 7 | 11789 | 1.6% |
| 0 | 5757 | 0.8% |
| 8 | 2905 | 0.4% |
| 9 | 2508 | 0.3% |
| Other values (8) | 765 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5757 | 0.8% |
| 1 | 209179 | |
| 2 | 217973 | |
| 3 | 133015 | |
| 4 | 93646 | |
| 5 | 55912 | 7.4% |
| 6 | 26160 | 3.4% |
| 7 | 11789 | 1.6% |
| 8 | 2905 | 0.4% |
| 9 | 2508 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 26 | < 0.1% |
| 18 | 2 | < 0.1% |
| 16 | 68 | < 0.1% |
| 15 | 21 | < 0.1% |
| 14 | 47 | < 0.1% |
| 12 | 211 | < 0.1% |
| 11 | 12 | < 0.1% |
| 10 | 378 | < 0.1% |
| 9 | 2508 | |
| 8 | 2905 |
d_check_complete
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.5 MiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 563118 | |
| False | 196491 | 25.9% |
moon_clearance_complete
Boolean
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.5 MiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 759609 |
price
Real number (ℝ)
| Distinct | 527 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3503.8751 |
| Minimum | 870 |
|---|---|
| Maximum | 86150 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 870 |
|---|---|
| 5-th percentile | 1260 |
| Q1 | 2417 |
| median | 3145 |
| Q3 | 4146 |
| 95-th percentile | 6707 |
| Maximum | 86150 |
| Range | 85280 |
| Interquartile range (IQR) | 1729 |
Descriptive statistics
| Standard deviation | 1866.609 |
|---|---|
| Coefficient of variation (CV) | 0.53272704 |
| Kurtosis | 58.018955 |
| Mean | 3503.8751 |
| Median Absolute Deviation (MAD) | 910 |
| Skewness | 3.3383237 |
| Sum | 2.6615751 × 109 |
| Variance | 3484229.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2430 | 17088 | 2.2% |
| 1130 | 16376 | 2.2% |
| 3457 | 15414 | 2.0% |
| 2170 | 12943 | 1.7% |
| 3015 | 10687 | 1.4% |
| 2820 | 10583 | 1.4% |
| 3470 | 8206 | 1.1% |
| 1910 | 8169 | 1.1% |
| 1195 | 7887 | 1.0% |
| 2677 | 7502 | 1.0% |
| Other values (517) | 644754 |
| Value | Count | Frequency (%) |
| 870 | 193 | < 0.1% |
| 961 | 2 | < 0.1% |
| 974 | 9 | < 0.1% |
| 1000 | 8 | < 0.1% |
| 1013 | 28 | < 0.1% |
| 1026 | 90 | < 0.1% |
| 1039 | 112 | < 0.1% |
| 1052 | 89 | < 0.1% |
| 1065 | 647 | |
| 1078 | 462 |
| Value | Count | Frequency (%) |
| 86150 | 9 | < 0.1% |
| 46370 | 1 | < 0.1% |
| 37270 | 1 | < 0.1% |
| 33370 | 53 | |
| 24920 | 30 | |
| 24270 | 2 | < 0.1% |
| 20370 | 6 | < 0.1% |
| 19720 | 11 | < 0.1% |
| 19070 | 8 | < 0.1% |
| 17120 | 18 | < 0.1% |
company_id
Real number (ℝ)
| Distinct | 15354 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26968.063 |
| Minimum | 4 |
|---|---|
| Maximum | 50094 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 6838 |
| Q1 | 22721 |
| median | 29647 |
| Q3 | 29647 |
| 95-th percentile | 42615 |
| Maximum | 50094 |
| Range | 50090 |
| Interquartile range (IQR) | 6926 |
Descriptive statistics
| Standard deviation | 9058.3928 |
|---|---|
| Coefficient of variation (CV) | 0.33589334 |
| Kurtosis | 1.1296658 |
| Mean | 26968.063 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.66148146 |
| Sum | 2.0485183 × 1010 |
| Variance | 82054480 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29647 | 383358 | |
| 28828 | 29808 | 3.9% |
| 32203 | 24288 | 3.2% |
| 20334 | 23881 | 3.1% |
| 4745 | 12996 | 1.7% |
| 18077 | 10625 | 1.4% |
| 10711 | 10476 | 1.4% |
| 19019 | 9792 | 1.3% |
| 18459 | 9216 | 1.2% |
| 15004 | 7744 | 1.0% |
| Other values (15344) | 237425 |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 9 | 4 | |
| 19 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| 36 | 9 | |
| 41 | 4 |
| Value | Count | Frequency (%) |
| 50094 | 1 | < 0.1% |
| 50089 | 4 | < 0.1% |
| 50085 | 1 | < 0.1% |
| 50080 | 9 | < 0.1% |
| 50078 | 1 | < 0.1% |
| 50074 | 64 | |
| 50072 | 9 | < 0.1% |
| 50071 | 1 | < 0.1% |
| 50070 | 1 | < 0.1% |
| 50063 | 1 | < 0.1% |
shuttle_id
Real number (ℝ)
| Distinct | 29768 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38504.629 |
| Minimum | 4 |
|---|---|
| Maximum | 77095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4054 |
| Q1 | 19107 |
| median | 39049 |
| Q3 | 58363 |
| 95-th percentile | 72358 |
| Maximum | 77095 |
| Range | 77091 |
| Interquartile range (IQR) | 39256 |
Descriptive statistics
| Standard deviation | 22169.907 |
|---|---|
| Coefficient of variation (CV) | 0.57577251 |
| Kurtosis | -1.2108859 |
| Mean | 38504.629 |
| Median Absolute Deviation (MAD) | 19612 |
| Skewness | -0.017028223 |
| Sum | 2.9248463 × 1010 |
| Variance | 4.9150476 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8692 | 1086 | 0.1% |
| 29483 | 1086 | 0.1% |
| 52116 | 1086 | 0.1% |
| 25234 | 1086 | 0.1% |
| 35034 | 1086 | 0.1% |
| 41758 | 1086 | 0.1% |
| 17273 | 1086 | 0.1% |
| 13084 | 1086 | 0.1% |
| 19472 | 1086 | 0.1% |
| 18120 | 1086 | 0.1% |
| Other values (29758) | 748749 |
| Value | Count | Frequency (%) |
| 4 | 3 | < 0.1% |
| 7 | 8 | |
| 9 | 3 | < 0.1% |
| 11 | 3 | < 0.1% |
| 25 | 1 | < 0.1% |
| 26 | 19 | |
| 28 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 77095 | 1 | < 0.1% |
| 77094 | 60 | |
| 77088 | 14 | < 0.1% |
| 77087 | 4 | < 0.1% |
| 77085 | 14 | < 0.1% |
| 77083 | 11 | < 0.1% |
| 77078 | 1 | < 0.1% |
| 77076 | 7 | < 0.1% |
| 77072 | 2 | < 0.1% |
| 77069 | 53 |
review_scores_rating
Real number (ℝ)
| Distinct | 54 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88.139768 |
| Minimum | 20 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 80 |
| median | 90 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 80 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.423344 |
|---|---|
| Coefficient of variation (CV) | 0.15229612 |
| Kurtosis | 6.1944853 |
| Mean | 88.139768 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -2.0437751 |
| Sum | 66951761 |
| Variance | 180.18617 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 225341 | |
| 80 | 125109 | |
| 90 | 60398 | 8.0% |
| 93 | 33046 | 4.4% |
| 95 | 24243 | 3.2% |
| 87 | 23592 | 3.1% |
| 96 | 21353 | 2.8% |
| 60 | 19800 | 2.6% |
| 92 | 15768 | 2.1% |
| 89 | 15388 | 2.0% |
| Other values (44) | 195571 |
| Value | Count | Frequency (%) |
| 20 | 6516 | |
| 27 | 20 | < 0.1% |
| 30 | 49 | < 0.1% |
| 40 | 10651 | |
| 45 | 3 | < 0.1% |
| 47 | 138 | < 0.1% |
| 48 | 59 | < 0.1% |
| 50 | 991 | 0.1% |
| 52 | 1086 | 0.1% |
| 53 | 384 | 0.1% |
| Value | Count | Frequency (%) |
| 100 | 225341 | |
| 99 | 3309 | 0.4% |
| 98 | 11465 | 1.5% |
| 97 | 13194 | 1.7% |
| 96 | 21353 | 2.8% |
| 95 | 24243 | 3.2% |
| 94 | 13044 | 1.7% |
| 93 | 33046 | 4.4% |
| 92 | 15768 | 2.1% |
| 91 | 13208 | 1.7% |
review_scores_comfort
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0526811 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.4072309 |
|---|---|
| Coefficient of variation (CV) | 0.15544908 |
| Kurtosis | 7.3092783 |
| Mean | 9.0526811 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.374577 |
| Sum | 6876498 |
| Variance | 1.9802989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 389432 | |
| 9 | 191509 | |
| 8 | 109788 | 14.5% |
| 6 | 33994 | 4.5% |
| 7 | 17598 | 2.3% |
| 2 | 9064 | 1.2% |
| 4 | 5969 | 0.8% |
| 5 | 2187 | 0.3% |
| 3 | 68 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 9064 | 1.2% |
| 3 | 68 | < 0.1% |
| 4 | 5969 | 0.8% |
| 5 | 2187 | 0.3% |
| 6 | 33994 | 4.5% |
| 7 | 17598 | 2.3% |
| 8 | 109788 | 14.5% |
| 9 | 191509 | |
| 10 | 389432 |
| Value | Count | Frequency (%) |
| 10 | 389432 | |
| 9 | 191509 | |
| 8 | 109788 | 14.5% |
| 7 | 17598 | 2.3% |
| 6 | 33994 | 4.5% |
| 5 | 2187 | 0.3% |
| 4 | 5969 | 0.8% |
| 3 | 68 | < 0.1% |
| 2 | 9064 | 1.2% |
review_scores_amenities
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.041291 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.3751226 |
|---|---|
| Coefficient of variation (CV) | 0.15209361 |
| Kurtosis | 6.0896619 |
| Mean | 9.041291 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.1648872 |
| Sum | 6867846 |
| Variance | 1.8909621 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 384767 | |
| 9 | 189878 | |
| 8 | 109769 | 14.5% |
| 6 | 30938 | 4.1% |
| 7 | 25844 | 3.4% |
| 4 | 6670 | 0.9% |
| 2 | 6201 | 0.8% |
| 5 | 5439 | 0.7% |
| 3 | 103 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 6201 | 0.8% |
| 3 | 103 | < 0.1% |
| 4 | 6670 | 0.9% |
| 5 | 5439 | 0.7% |
| 6 | 30938 | 4.1% |
| 7 | 25844 | 3.4% |
| 8 | 109769 | 14.5% |
| 9 | 189878 | |
| 10 | 384767 |
| Value | Count | Frequency (%) |
| 10 | 384767 | |
| 9 | 189878 | |
| 8 | 109769 | 14.5% |
| 7 | 25844 | 3.4% |
| 6 | 30938 | 4.1% |
| 5 | 5439 | 0.7% |
| 4 | 6670 | 0.9% |
| 3 | 103 | < 0.1% |
| 2 | 6201 | 0.8% |
review_scores_trip
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0470571 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.5232991 |
|---|---|
| Coefficient of variation (CV) | 0.1683751 |
| Kurtosis | 6.6676864 |
| Mean | 9.0470571 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.4104593 |
| Sum | 6872226 |
| Variance | 2.3204402 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 411299 | |
| 9 | 180026 | |
| 8 | 90890 | 12.0% |
| 6 | 29118 | 3.8% |
| 7 | 20843 | 2.7% |
| 4 | 13465 | 1.8% |
| 2 | 10767 | 1.4% |
| 5 | 3138 | 0.4% |
| 3 | 63 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 10767 | 1.4% |
| 3 | 63 | < 0.1% |
| 4 | 13465 | 1.8% |
| 5 | 3138 | 0.4% |
| 6 | 29118 | 3.8% |
| 7 | 20843 | 2.7% |
| 8 | 90890 | 12.0% |
| 9 | 180026 | |
| 10 | 411299 |
| Value | Count | Frequency (%) |
| 10 | 411299 | |
| 9 | 180026 | |
| 8 | 90890 | 12.0% |
| 7 | 20843 | 2.7% |
| 6 | 29118 | 3.8% |
| 5 | 3138 | 0.4% |
| 4 | 13465 | 1.8% |
| 3 | 63 | < 0.1% |
| 2 | 10767 | 1.4% |
review_scores_crew
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1330895 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.4460103 |
|---|---|
| Coefficient of variation (CV) | 0.15832652 |
| Kurtosis | 7.5445028 |
| Mean | 9.1330895 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.5006075 |
| Sum | 6937577 |
| Variance | 2.0909458 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 440561 | |
| 9 | 159370 | 21.0% |
| 8 | 90224 | 11.9% |
| 6 | 36786 | 4.8% |
| 7 | 14003 | 1.8% |
| 2 | 9853 | 1.3% |
| 4 | 6486 | 0.9% |
| 5 | 2240 | 0.3% |
| 3 | 86 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 9853 | 1.3% |
| 3 | 86 | < 0.1% |
| 4 | 6486 | 0.9% |
| 5 | 2240 | 0.3% |
| 6 | 36786 | 4.8% |
| 7 | 14003 | 1.8% |
| 8 | 90224 | 11.9% |
| 9 | 159370 | 21.0% |
| 10 | 440561 |
| Value | Count | Frequency (%) |
| 10 | 440561 | |
| 9 | 159370 | 21.0% |
| 8 | 90224 | 11.9% |
| 7 | 14003 | 1.8% |
| 6 | 36786 | 4.8% |
| 5 | 2240 | 0.3% |
| 4 | 6486 | 0.9% |
| 3 | 86 | < 0.1% |
| 2 | 9853 | 1.3% |
review_scores_location
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.3581461 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 9 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1169967 |
|---|---|
| Coefficient of variation (CV) | 0.1193609 |
| Kurtosis | 9.8498643 |
| Mean | 9.3581461 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.7231754 |
| Sum | 7108532 |
| Variance | 1.2476817 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 474944 | |
| 9 | 169755 | 22.3% |
| 8 | 78401 | 10.3% |
| 6 | 16076 | 2.1% |
| 7 | 9768 | 1.3% |
| 4 | 8653 | 1.1% |
| 2 | 1805 | 0.2% |
| 5 | 207 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 1805 | 0.2% |
| 4 | 8653 | 1.1% |
| 5 | 207 | < 0.1% |
| 6 | 16076 | 2.1% |
| 7 | 9768 | 1.3% |
| 8 | 78401 | 10.3% |
| 9 | 169755 | 22.3% |
| 10 | 474944 |
| Value | Count | Frequency (%) |
| 10 | 474944 | |
| 9 | 169755 | 22.3% |
| 8 | 78401 | 10.3% |
| 7 | 9768 | 1.3% |
| 6 | 16076 | 2.1% |
| 5 | 207 | < 0.1% |
| 4 | 8653 | 1.1% |
| 2 | 1805 | 0.2% |
review_scores_price
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6967848 |
| Minimum | 2 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 8 |
| median | 9 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4170067 |
|---|---|
| Coefficient of variation (CV) | 0.16293455 |
| Kurtosis | 5.1504235 |
| Mean | 8.6967848 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.8956897 |
| Sum | 6606156 |
| Variance | 2.007908 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 253004 | |
| 10 | 241909 | |
| 8 | 175424 | |
| 6 | 40786 | 5.4% |
| 7 | 26930 | 3.5% |
| 4 | 9912 | 1.3% |
| 2 | 8090 | 1.1% |
| 5 | 3461 | 0.5% |
| 3 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 8090 | 1.1% |
| 3 | 93 | < 0.1% |
| 4 | 9912 | 1.3% |
| 5 | 3461 | 0.5% |
| 6 | 40786 | 5.4% |
| 7 | 26930 | 3.5% |
| 8 | 175424 | |
| 9 | 253004 | |
| 10 | 241909 |
| Value | Count | Frequency (%) |
| 10 | 241909 | |
| 9 | 253004 | |
| 8 | 175424 | |
| 7 | 26930 | 3.5% |
| 6 | 40786 | 5.4% |
| 5 | 3461 | 0.5% |
| 4 | 9912 | 1.3% |
| 3 | 93 | < 0.1% |
| 2 | 8090 | 1.1% |
number_of_reviews
Real number (ℝ)
| Distinct | 358 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.297036 |
| Minimum | 1 |
|---|---|
| Maximum | 578 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 9 |
| 95-th percentile | 45 |
| Maximum | 578 |
| Range | 577 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 23.048411 |
|---|---|
| Coefficient of variation (CV) | 2.238354 |
| Kurtosis | 70.719239 |
| Mean | 10.297036 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.736901 |
| Sum | 7821721 |
| Variance | 531.22926 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 244576 | |
| 2 | 116762 | |
| 3 | 60017 | 7.9% |
| 4 | 49592 | 6.5% |
| 5 | 33117 | 4.4% |
| 6 | 23197 | 3.1% |
| 7 | 18693 | 2.5% |
| 8 | 17732 | 2.3% |
| 9 | 14408 | 1.9% |
| 11 | 12588 | 1.7% |
| Other values (348) | 168927 |
| Value | Count | Frequency (%) |
| 1 | 244576 | |
| 2 | 116762 | |
| 3 | 60017 | 7.9% |
| 4 | 49592 | 6.5% |
| 5 | 33117 | 4.4% |
| 6 | 23197 | 3.1% |
| 7 | 18693 | 2.5% |
| 8 | 17732 | 2.3% |
| 9 | 14408 | 1.9% |
| 10 | 11357 | 1.5% |
| Value | Count | Frequency (%) |
| 578 | 4 | |
| 529 | 3 | |
| 507 | 4 | |
| 501 | 1 | < 0.1% |
| 481 | 4 | |
| 471 | 1 | < 0.1% |
| 468 | 1 | < 0.1% |
| 467 | 2 | |
| 461 | 1 | < 0.1% |
| 456 | 1 | < 0.1% |
reviews_per_month
Real number (ℝ)
| Distinct | 899 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.83372731 |
| Minimum | 0.01 |
|---|---|
| Maximum | 16.56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.16 |
| median | 0.4 |
| Q3 | 1 |
| 95-th percentile | 3.14 |
| Maximum | 16.56 |
| Range | 16.55 |
| Interquartile range (IQR) | 0.84 |
Descriptive statistics
| Standard deviation | 1.1109912 |
|---|---|
| Coefficient of variation (CV) | 1.3325594 |
| Kurtosis | 10.780633 |
| Mean | 0.83372731 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 2.7770527 |
| Sum | 633306.77 |
| Variance | 1.2343014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.06 | 28993 | 3.8% |
| 0.04 | 20840 | 2.7% |
| 0.09 | 20085 | 2.6% |
| 0.11 | 16456 | 2.2% |
| 0.2 | 16083 | 2.1% |
| 0.07 | 15663 | 2.1% |
| 0.24 | 14789 | 1.9% |
| 1 | 13604 | 1.8% |
| 0.13 | 12844 | 1.7% |
| 0.03 | 12598 | 1.7% |
| Other values (889) | 587654 |
| Value | Count | Frequency (%) |
| 0.01 | 9 | < 0.1% |
| 0.02 | 246 | < 0.1% |
| 0.03 | 12598 | |
| 0.04 | 20840 | |
| 0.05 | 6892 | 0.9% |
| 0.06 | 28993 | |
| 0.07 | 15663 | |
| 0.08 | 10374 | 1.4% |
| 0.09 | 20085 | |
| 0.1 | 11083 | 1.5% |
| Value | Count | Frequency (%) |
| 16.56 | 6 | |
| 15.69 | 3 | < 0.1% |
| 14.13 | 3 | < 0.1% |
| 13.26 | 1 | < 0.1% |
| 12.6 | 4 | |
| 12.59 | 1 | < 0.1% |
| 12.5 | 1 | < 0.1% |
| 12.39 | 8 | |
| 12.21 | 2 | < 0.1% |
| 12.12 | 2 | < 0.1% |
id_y
Real number (ℝ)
| Distinct | 15354 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26968.063 |
| Minimum | 4 |
|---|---|
| Maximum | 50094 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 6838 |
| Q1 | 22721 |
| median | 29647 |
| Q3 | 29647 |
| 95-th percentile | 42615 |
| Maximum | 50094 |
| Range | 50090 |
| Interquartile range (IQR) | 6926 |
Descriptive statistics
| Standard deviation | 9058.3928 |
|---|---|
| Coefficient of variation (CV) | 0.33589334 |
| Kurtosis | 1.1296658 |
| Mean | 26968.063 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.66148146 |
| Sum | 2.0485183 × 1010 |
| Variance | 82054480 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29647 | 383358 | |
| 28828 | 29808 | 3.9% |
| 32203 | 24288 | 3.2% |
| 20334 | 23881 | 3.1% |
| 4745 | 12996 | 1.7% |
| 18077 | 10625 | 1.4% |
| 10711 | 10476 | 1.4% |
| 19019 | 9792 | 1.3% |
| 18459 | 9216 | 1.2% |
| 15004 | 7744 | 1.0% |
| Other values (15344) | 237425 |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 9 | 4 | |
| 19 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| 36 | 9 | |
| 41 | 4 |
| Value | Count | Frequency (%) |
| 50094 | 1 | < 0.1% |
| 50089 | 4 | < 0.1% |
| 50085 | 1 | < 0.1% |
| 50080 | 9 | < 0.1% |
| 50078 | 1 | < 0.1% |
| 50074 | 64 | |
| 50072 | 9 | < 0.1% |
| 50071 | 1 | < 0.1% |
| 50070 | 1 | < 0.1% |
| 50063 | 1 | < 0.1% |
company_rating
Real number (ℝ)
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9844897 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 1745 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.92 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.068364596 |
|---|---|
| Coefficient of variation (CV) | 0.069441657 |
| Kurtosis | 113.81699 |
| Mean | 0.9844897 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -9.4497057 |
| Sum | 747827.24 |
| Variance | 0.0046737181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 611179 | |
| 0.99 | 44585 | 5.9% |
| 0.97 | 16237 | 2.1% |
| 0.93 | 12897 | 1.7% |
| 0.98 | 12873 | 1.7% |
| 0.96 | 8211 | 1.1% |
| 0.94 | 7572 | 1.0% |
| 0.9 | 6720 | 0.9% |
| 0.95 | 6463 | 0.9% |
| 0.89 | 6141 | 0.8% |
| Other values (54) | 26731 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 1745 | |
| 0.1 | 13 | < 0.1% |
| 0.11 | 1 | < 0.1% |
| 0.13 | 3 | < 0.1% |
| 0.17 | 9 | < 0.1% |
| 0.2 | 23 | < 0.1% |
| 0.22 | 3 | < 0.1% |
| 0.25 | 96 | < 0.1% |
| 0.29 | 6 | < 0.1% |
| 0.3 | 36 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 611179 | |
| 0.99 | 44585 | 5.9% |
| 0.98 | 12873 | 1.7% |
| 0.97 | 16237 | 2.1% |
| 0.96 | 8211 | 1.1% |
| 0.95 | 6463 | 0.9% |
| 0.94 | 7572 | 1.0% |
| 0.93 | 12897 | 1.7% |
| 0.92 | 2415 | 0.3% |
| 0.91 | 3612 | 0.5% |
company_location
Categorical
| Distinct | 159 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
| Peru | |
|---|---|
| Niger | 35830 |
| Isle of Man | 31868 |
| Barbados | 27328 |
| Nicaragua | 20023 |
| Other values (154) |
Length
| Max length | 51 |
|---|---|
| Median length | 4 |
| Mean length | 6.5122306 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4946749 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Niue |
|---|---|
| 2nd row | Niue |
| 3rd row | Niue |
| 4th row | Niue |
| 5th row | Niue |
Common Values
| Value | Count | Frequency (%) |
| Peru | 383592 | |
| Niger | 35830 | 4.7% |
| Isle of Man | 31868 | 4.2% |
| Barbados | 27328 | 3.6% |
| Nicaragua | 20023 | 2.6% |
| Uzbekistan | 19161 | 2.5% |
| Sao Tome and Principe | 14834 | 2.0% |
| Croatia | 14721 | 1.9% |
| Uganda | 14529 | 1.9% |
| Zimbabwe | 12553 | 1.7% |
| Other values (149) | 185170 |
Length
| Value | Count | Frequency (%) |
| peru | 383592 | |
| niger | 35830 | 3.9% |
| of | 31881 | 3.5% |
| isle | 31868 | 3.4% |
| man | 31868 | 3.4% |
| barbados | 27328 | 3.0% |
| and | 23108 | 2.5% |
| nicaragua | 20023 | 2.2% |
| uzbekistan | 19161 | 2.1% |
| sao | 14834 | 1.6% |
| Other values (206) | 304526 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 632156 | |
| r | 583148 | |
| a | 511180 | 10.3% |
| u | 463620 | 9.4% |
| P | 409579 | 8.3% |
| i | 281353 | 5.7% |
| n | 249384 | 5.0% |
| o | 189771 | 3.8% |
| s | 171437 | 3.5% |
| 164410 | 3.3% | |
| Other values (47) | 1290711 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3902388 | |
| Uppercase Letter | 868433 | 17.6% |
| Space Separator | 164410 | 3.3% |
| Open Punctuation | 5473 | 0.1% |
| Close Punctuation | 5473 | 0.1% |
| Other Punctuation | 545 | < 0.1% |
| Decimal Number | 26 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 632156 | |
| r | 583148 | |
| a | 511180 | |
| u | 463620 | |
| i | 281353 | |
| n | 249384 | 6.4% |
| o | 189771 | 4.9% |
| s | 171437 | 4.4% |
| d | 121553 | 3.1% |
| l | 108391 | 2.8% |
| Other values (16) | 590395 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 409579 | |
| M | 66117 | 7.6% |
| N | 61882 | 7.1% |
| I | 49516 | 5.7% |
| U | 41113 | 4.7% |
| B | 36452 | 4.2% |
| S | 31674 | 3.6% |
| C | 29706 | 3.4% |
| T | 28882 | 3.3% |
| G | 18377 | 2.1% |
| Other values (13) | 95135 | 11.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 544 | |
| ' | 1 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 13 | |
| 0 | 13 |
Space Separator
| Value | Count | Frequency (%) |
| 164410 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5473 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5473 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4770821 | |
| Common | 175928 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 632156 | |
| r | 583148 | |
| a | 511180 | |
| u | 463620 | |
| P | 409579 | 8.6% |
| i | 281353 | 5.9% |
| n | 249384 | 5.2% |
| o | 189771 | 4.0% |
| s | 171437 | 3.6% |
| d | 121553 | 2.5% |
| Other values (39) | 1157640 |
Common
| Value | Count | Frequency (%) |
| 164410 | ||
| ( | 5473 | 3.1% |
| ) | 5473 | 3.1% |
| & | 544 | 0.3% |
| 6 | 13 | < 0.1% |
| 0 | 13 | < 0.1% |
| ' | 1 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4946749 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 632156 | |
| r | 583148 | |
| a | 511180 | 10.3% |
| u | 463620 | 9.4% |
| P | 409579 | 8.3% |
| i | 281353 | 5.7% |
| n | 249384 | 5.0% |
| o | 189771 | 3.8% |
| s | 171437 | 3.5% |
| 164410 | 3.3% | |
| Other values (47) | 1290711 |
total_fleet_count
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 716.91878 |
| Minimum | 1 |
|---|---|
| Maximum | 1484 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 60 |
| median | 1305 |
| Q3 | 1305 |
| 95-th percentile | 1305 |
| Maximum | 1484 |
| Range | 1483 |
| Interquartile range (IQR) | 1245 |
Descriptive statistics
| Standard deviation | 616.04032 |
|---|---|
| Coefficient of variation (CV) | 0.85928885 |
| Kurtosis | -1.9648316 |
| Mean | 716.91878 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.08520072 |
| Sum | 5.4457796 × 108 |
| Variance | 379505.68 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1305 | 383358 | |
| 198 | 29808 | 3.9% |
| 176 | 24288 | 3.2% |
| 171 | 23881 | 3.1% |
| 108 | 16846 | 2.2% |
| 119 | 12996 | 1.7% |
| 139 | 10625 | 1.4% |
| 1484 | 9792 | 1.3% |
| 2 | 9497 | 1.3% |
| 1 | 9230 | 1.2% |
| Other values (71) | 229288 |
| Value | Count | Frequency (%) |
| 1 | 9230 | |
| 2 | 9497 | |
| 3 | 6757 | |
| 4 | 5873 | |
| 5 | 4193 | |
| 6 | 4290 | |
| 7 | 3586 | 0.5% |
| 8 | 3859 | |
| 9 | 3881 | |
| 10 | 3701 | 0.5% |
| Value | Count | Frequency (%) |
| 1484 | 9792 | 1.3% |
| 1305 | 383358 | |
| 420 | 2376 | 0.3% |
| 419 | 1 | < 0.1% |
| 198 | 29808 | 3.9% |
| 185 | 48 | < 0.1% |
| 176 | 24288 | 3.2% |
| 171 | 23881 | 3.1% |
| 139 | 10625 | 1.4% |
| 130 | 3476 | 0.5% |
iata_approved
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.5 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 621688 | |
| True | 137921 | 18.2% |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| id_x | shuttle_location | shuttle_type | engine_type | engine_vendor | engines | passenger_capacity | cancellation_policy | crew | d_check_complete | moon_clearance_complete | price | company_id | shuttle_id | review_scores_rating | review_scores_comfort | review_scores_amenities | review_scores_trip | review_scores_crew | review_scores_location | review_scores_price | number_of_reviews | reviews_per_month | id_y | company_rating | company_location | total_fleet_count | iata_approved | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 63561 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 63561 | 97.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 133 | 1.65 | 35029 | 1.0 | Niue | 4.0 | False |
| 1 | 63561 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 63561 | 97.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 133 | 1.65 | 35029 | 1.0 | Niue | 4.0 | False |
| 2 | 63561 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 63561 | 97.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 133 | 1.65 | 35029 | 1.0 | Niue | 4.0 | False |
| 3 | 63561 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 63561 | 97.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 133 | 1.65 | 35029 | 1.0 | Niue | 4.0 | False |
| 4 | 53260 | Niue | Type V5 | Quantum | Banks, Wood and Phillips | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 53260 | 98.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 37 | 0.48 | 35029 | 1.0 | Niue | 4.0 | False |
| 5 | 53260 | Niue | Type V5 | Quantum | Banks, Wood and Phillips | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 53260 | 98.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 37 | 0.48 | 35029 | 1.0 | Niue | 4.0 | False |
| 6 | 53260 | Niue | Type V5 | Quantum | Banks, Wood and Phillips | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 53260 | 98.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 37 | 0.48 | 35029 | 1.0 | Niue | 4.0 | False |
| 7 | 53260 | Niue | Type V5 | Quantum | Banks, Wood and Phillips | 1.0 | 2 | strict | 1.0 | False | False | 1325.0 | 35029 | 53260 | 98.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 37 | 0.48 | 35029 | 1.0 | Niue | 4.0 | False |
| 8 | 51019 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | flexible | 1.0 | False | False | 1260.0 | 35029 | 51019 | 92.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 9.0 | 10 | 0.15 | 35029 | 1.0 | Niue | 4.0 | False |
| 9 | 51019 | Niue | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | flexible | 1.0 | False | False | 1260.0 | 35029 | 51019 | 92.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 9.0 | 10 | 0.15 | 35029 | 1.0 | Niue | 4.0 | False |
| id_x | shuttle_location | shuttle_type | engine_type | engine_vendor | engines | passenger_capacity | cancellation_policy | crew | d_check_complete | moon_clearance_complete | price | company_id | shuttle_id | review_scores_rating | review_scores_comfort | review_scores_amenities | review_scores_trip | review_scores_crew | review_scores_location | review_scores_price | number_of_reviews | reviews_per_month | id_y | company_rating | company_location | total_fleet_count | iata_approved | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1863945 | 49839 | Rwanda | Type V5 | Nuclear | Banks, Wood and Phillips | 1.0 | 1 | flexible | 1.0 | True | False | 1195.0 | 2878 | 49839 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 2 | 2.0 | 2878 | 0.93 | Bosnia and Herzegovina | 3.0 | True |
| 1863946 | 49839 | Rwanda | Type V5 | Nuclear | Banks, Wood and Phillips | 1.0 | 1 | flexible | 1.0 | True | False | 1195.0 | 2878 | 49839 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 2 | 2.0 | 2878 | 0.93 | Bosnia and Herzegovina | 3.0 | True |
| 1863947 | 49839 | Rwanda | Type V5 | Nuclear | Banks, Wood and Phillips | 1.0 | 1 | flexible | 1.0 | True | False | 1195.0 | 2878 | 49839 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 2 | 2.0 | 2878 | 0.93 | Bosnia and Herzegovina | 3.0 | True |
| 1863998 | 62216 | Chad | Type F5 | Quantum | ThetaBase Services | 1.0 | 2 | moderate | 1.0 | True | False | 1169.0 | 31216 | 62216 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 31216 | 1.00 | Togo | 1.0 | True |
| 1864306 | 39094 | Russian Federation | Type F5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1455.0 | 42904 | 39094 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 42904 | 0.70 | Russian Federation | 2.0 | True |
| 1864307 | 39094 | Russian Federation | Type F5 | Quantum | ThetaBase Services | 1.0 | 2 | strict | 1.0 | False | False | 1455.0 | 42904 | 39094 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 42904 | 0.70 | Russian Federation | 2.0 | True |
| 1864357 | 20330 | Uzbekistan | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | flexible | 1.0 | False | False | 1585.0 | 5701 | 20330 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 5701 | 1.00 | Costa Rica | 1.0 | True |
| 1864386 | 16445 | Nicaragua | Type V5 | Plasma | ThetaBase Services | 1.0 | 1 | flexible | 3.0 | False | False | 1715.0 | 13728 | 16445 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 3 | 3.0 | 13728 | 1.00 | Pakistan | 1.0 | False |
| 1864448 | 76469 | Bouvet Island (Bouvetoya) | Type V5 | Quantum | ThetaBase Services | 1.0 | 2 | moderate | 1.0 | False | False | 1520.0 | 41714 | 76469 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 41714 | 1.00 | Lebanon | 1.0 | False |
| 1864889 | 75780 | Russian Federation | Type V5 | Plasma | ThetaBase Services | 1.0 | 2 | strict | 1.0 | True | False | 2820.0 | 47766 | 75780 | 100.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 1 | 1.0 | 47766 | 1.00 | Uzbekistan | 1.0 | True |
Most frequently occurring
| id_x | shuttle_location | shuttle_type | engine_type | engine_vendor | engines | passenger_capacity | cancellation_policy | crew | d_check_complete | moon_clearance_complete | price | company_id | shuttle_id | review_scores_rating | review_scores_comfort | review_scores_amenities | review_scores_trip | review_scores_crew | review_scores_location | review_scores_price | number_of_reviews | reviews_per_month | id_y | company_rating | company_location | total_fleet_count | iata_approved | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 12 | 53 | Barbados | Type V5 | Plasma | ThetaBase Services | 1.0 | 2 | strict | 1.0 | True | False | 3509.0 | 29647 | 53 | 90.0 | 9.0 | 10.0 | 9.0 | 8.0 | 9.0 | 9.0 | 8 | 0.33 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 111 | 484 | Barbados | Type V5 | Plasma | ThetaBase Services | 2.0 | 4 | strict | 3.0 | True | False | 3938.0 | 29647 | 484 | 80.0 | 10.0 | 10.0 | 10.0 | 10.0 | 8.0 | 8.0 | 1 | 0.07 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 124 | 544 | Nicaragua | Type F5 | Plasma | ThetaBase Services | 4.0 | 8 | strict | 4.0 | True | False | 8332.0 | 29647 | 544 | 100.0 | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 10.0 | 5 | 0.19 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 140 | 609 | Barbados | Type V5 | Plasma | ThetaBase Services | 2.0 | 4 | strict | 3.0 | True | False | 4588.0 | 29647 | 609 | 80.0 | 10.0 | 10.0 | 10.0 | 10.0 | 10.0 | 8.0 | 1 | 0.19 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 211 | 883 | Sao Tome and Principe | Type V5 | Plasma | ThetaBase Services | 6.0 | 14 | strict | 9.0 | True | False | 7708.0 | 29647 | 883 | 100.0 | 10.0 | 10.0 | 10.0 | 8.0 | 10.0 | 10.0 | 1 | 0.22 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 357 | 1412 | Micronesia | Type V5 | Plasma | ThetaBase Services | 4.0 | 8 | strict | 5.0 | True | False | 6473.0 | 29647 | 1412 | 100.0 | 10.0 | 10.0 | 6.0 | 8.0 | 10.0 | 8.0 | 1 | 0.24 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 495 | 1990 | Malta | Type V5 | Plasma | ThetaBase Services | 1.0 | 3 | strict | 2.0 | True | False | 2443.0 | 29647 | 1990 | 20.0 | 2.0 | 2.0 | 2.0 | 2.0 | 6.0 | 2.0 | 1 | 0.19 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 554 | 2249 | Niue | Type V5 | Plasma | ThetaBase Services | 5.0 | 8 | strict | 5.0 | True | False | 4601.0 | 29647 | 2249 | 80.0 | 8.0 | 8.0 | 6.0 | 8.0 | 8.0 | 6.0 | 1 | 0.30 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 569 | 2307 | Russian Federation | Type V5 | Plasma | ThetaBase Services | 1.0 | 2 | strict | 1.0 | True | False | 2677.0 | 29647 | 2307 | 95.0 | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 11 | 0.38 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |
| 677 | 2786 | Rwanda | Type F5 | Plasma | ThetaBase Services | 2.0 | 4 | strict | 2.0 | True | False | 3561.0 | 29647 | 2786 | 100.0 | 10.0 | 10.0 | 9.0 | 9.0 | 10.0 | 10.0 | 7 | 0.76 | 29647 | 1.0 | Peru | 1305.0 | False | 1086 |